Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Practical vision based degraded text recognition system

Identifieur interne : 000549 ( Main/Exploration ); précédent : 000548; suivant : 000550

Practical vision based degraded text recognition system

Auteurs : Khader Mohammad [États-Unis] ; Sos Agaian [États-Unis] ; Hani Saleh [États-Unis]

Source :

RBID : Pascal:11-0263425

Descripteurs français

English descriptors

Abstract

Rapid growth and progress in the medical, industrial, security and technology fields means more and more consideration for the use of camera based optical character recognition (OCR) Applying OCR to scanned documents is quite mature, and there are many commercial and research products available on this topic. These products achieve acceptable recognition accuracy and reasonable processing times especially with trained software, and constrained text characteristics. Even though the application space for OCR is huge, it is quite challenging to design a single system that is capable of performing automatic OCR for text embedded in an image irrespective of the application. Challenges for OCR systems include; images are taken under natural real world conditions, Surface curvature, text orientation, font, size, lighting conditions, and noise. These and many other conditions make it extremely difficult to achieve reasonable character recognition. Performance for conventional OCR systems drops dramatically as the degradation level of the text image quality increases. In this paper, a new recognition method is proposed to recognize solid or dotted line degraded characters. The degraded text string is localized and segmented using a new algorithm. The new method was implemented and tested using a development framework system that is capable of performing OCR on camera captured images. The framework allows parameter tuning of the image-processing algorithm based on a training set of camera-captured text images. Novel methods were used for enhancement, text localization and the segmentation algorithm which enables building a custom system that is capable of performing automatic OCR which can be used for different applications. The developed framework system includes: new image enhancement, filtering, and segmentation techniques which enabled higher recognition accuracies, faster processing time, and lower energy consumption, compared with the best state of the art published techniques. The system successfully produced impressive OCR accuracies (90% -to- 93%) using customized systems generated by our development framework in two industrial OCR applications: water bottle label text recognition and concrete slab plate text recognition. The system was also trained for the Arabic language alphabet, and demonstrated extremely high recognition accuracy (99%) for Arabic license name plate text recognition with processing times of 10 seconds. The accuracy and run times of the system were compared to conventional and many states of art methods, the proposed system shows excellent results.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Practical vision based degraded text recognition system</title>
<author>
<name sortKey="Mohammad, Khader" sort="Mohammad, Khader" uniqKey="Mohammad K" first="Khader" last="Mohammad">Khader Mohammad</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>University of Texas at San Antonio, 6900 North Loop 1604 West</s1>
<s2>San Antonio, TX</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Texas</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Agaian, Sos" sort="Agaian, Sos" uniqKey="Agaian S" first="Sos" last="Agaian">Sos Agaian</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>University of Texas at San Antonio, 6900 North Loop 1604 West</s1>
<s2>San Antonio, TX</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Texas</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Saleh, Hani" sort="Saleh, Hani" uniqKey="Saleh H" first="Hani" last="Saleh">Hani Saleh</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>University of Texas at San Antonio, 6900 North Loop 1604 West</s1>
<s2>San Antonio, TX</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Texas</region>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">11-0263425</idno>
<date when="2011">2011</date>
<idno type="stanalyst">PASCAL 11-0263425 INIST</idno>
<idno type="RBID">Pascal:11-0263425</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000137</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000636</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000103</idno>
<idno type="wicri:doubleKey">0277-786X:2011:Mohammad K:practical:vision:based</idno>
<idno type="wicri:Area/Main/Merge">000555</idno>
<idno type="wicri:Area/Main/Curation">000549</idno>
<idno type="wicri:Area/Main/Exploration">000549</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Practical vision based degraded text recognition system</title>
<author>
<name sortKey="Mohammad, Khader" sort="Mohammad, Khader" uniqKey="Mohammad K" first="Khader" last="Mohammad">Khader Mohammad</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>University of Texas at San Antonio, 6900 North Loop 1604 West</s1>
<s2>San Antonio, TX</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Texas</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Agaian, Sos" sort="Agaian, Sos" uniqKey="Agaian S" first="Sos" last="Agaian">Sos Agaian</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>University of Texas at San Antonio, 6900 North Loop 1604 West</s1>
<s2>San Antonio, TX</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Texas</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Saleh, Hani" sort="Saleh, Hani" uniqKey="Saleh H" first="Hani" last="Saleh">Hani Saleh</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>University of Texas at San Antonio, 6900 North Loop 1604 West</s1>
<s2>San Antonio, TX</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Texas</region>
</placeName>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Proceedings of SPIE, the International Society for Optical Engineering</title>
<title level="j" type="abbreviated">Proc. SPIE Int. Soc. Opt. Eng.</title>
<idno type="ISSN">0277-786X</idno>
<imprint>
<date when="2011">2011</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Proceedings of SPIE, the International Society for Optical Engineering</title>
<title level="j" type="abbreviated">Proc. SPIE Int. Soc. Opt. Eng.</title>
<idno type="ISSN">0277-786X</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Algorithm</term>
<term>Arabic</term>
<term>Arabic alphabet</term>
<term>Character recognition</term>
<term>Degradation</term>
<term>Electric power consumption</term>
<term>Filtering</term>
<term>High precision</term>
<term>Image enhancement</term>
<term>Image processing</term>
<term>Image quality</term>
<term>Implementation</term>
<term>Industrial application</term>
<term>Industrial technology</term>
<term>Learning</term>
<term>Localization</term>
<term>Optical character recognition</term>
<term>Pattern recognition</term>
<term>Performance evaluation</term>
<term>Processing time</term>
<term>Safety</term>
<term>Segmentation</term>
<term>State of the art</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Reconnaissance caractère</term>
<term>Technologie industrielle</term>
<term>Sécurité</term>
<term>Reconnaissance optique caractère</term>
<term>Précision élevée</term>
<term>Temps traitement</term>
<term>Evaluation performance</term>
<term>Dégradation</term>
<term>Qualité image</term>
<term>Localisation</term>
<term>Algorithme</term>
<term>Implémentation</term>
<term>Traitement image</term>
<term>Apprentissage</term>
<term>Segmentation</term>
<term>Accentuation image</term>
<term>Filtrage</term>
<term>Consommation électricité</term>
<term>Etat actuel</term>
<term>Application industrielle</term>
<term>Arabe</term>
<term>Reconnaissance forme</term>
<term>Alphabet arabe</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Rapid growth and progress in the medical, industrial, security and technology fields means more and more consideration for the use of camera based optical character recognition (OCR) Applying OCR to scanned documents is quite mature, and there are many commercial and research products available on this topic. These products achieve acceptable recognition accuracy and reasonable processing times especially with trained software, and constrained text characteristics. Even though the application space for OCR is huge, it is quite challenging to design a single system that is capable of performing automatic OCR for text embedded in an image irrespective of the application. Challenges for OCR systems include; images are taken under natural real world conditions, Surface curvature, text orientation, font, size, lighting conditions, and noise. These and many other conditions make it extremely difficult to achieve reasonable character recognition. Performance for conventional OCR systems drops dramatically as the degradation level of the text image quality increases. In this paper, a new recognition method is proposed to recognize solid or dotted line degraded characters. The degraded text string is localized and segmented using a new algorithm. The new method was implemented and tested using a development framework system that is capable of performing OCR on camera captured images. The framework allows parameter tuning of the image-processing algorithm based on a training set of camera-captured text images. Novel methods were used for enhancement, text localization and the segmentation algorithm which enables building a custom system that is capable of performing automatic OCR which can be used for different applications. The developed framework system includes: new image enhancement, filtering, and segmentation techniques which enabled higher recognition accuracies, faster processing time, and lower energy consumption, compared with the best state of the art published techniques. The system successfully produced impressive OCR accuracies (90% -to- 93%) using customized systems generated by our development framework in two industrial OCR applications: water bottle label text recognition and concrete slab plate text recognition. The system was also trained for the Arabic language alphabet, and demonstrated extremely high recognition accuracy (99%) for Arabic license name plate text recognition with processing times of 10 seconds. The accuracy and run times of the system were compared to conventional and many states of art methods, the proposed system shows excellent results.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>États-Unis</li>
</country>
<region>
<li>Texas</li>
</region>
</list>
<tree>
<country name="États-Unis">
<region name="Texas">
<name sortKey="Mohammad, Khader" sort="Mohammad, Khader" uniqKey="Mohammad K" first="Khader" last="Mohammad">Khader Mohammad</name>
</region>
<name sortKey="Agaian, Sos" sort="Agaian, Sos" uniqKey="Agaian S" first="Sos" last="Agaian">Sos Agaian</name>
<name sortKey="Saleh, Hani" sort="Saleh, Hani" uniqKey="Saleh H" first="Hani" last="Saleh">Hani Saleh</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000549 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000549 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:11-0263425
   |texte=   Practical vision based degraded text recognition system
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024